Enhancement automatic speech recognition by deep neural networks

نویسندگان

چکیده

The performance of speech recognition tasks utilizing systems based on deep learning has improved dramatically in recent years by different designs and methodologies. A popular way to boosting the number training data is called Data Augmentation (DA), research shows that using DA effective teaching neural network models how make invariant predictions. furthermore, EM approaches have piqued machine-learning researchers' attention as a means improving classifier performance. In this study, been presenteded unique employs both improve system's prediction accuracy. firstly, reveal an approach vocal tract length disturbance already exists then propose Feature perturbation alternative approach. order amendment sets. This followed integration posterior probabilities obtained from several DNN acoustic trained diverse datasets. study's findings proposed skills improved.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Automatic Speech Recognition with Deep Neural Networks for Impaired Speech

Automatic Speech Recognition has reached almost human performance in some controlled scenarios. However, recognition of impaired speech is a difficult task for two main reasons: data is (i) scarce and (ii) heterogeneous. In this work we train different architectures on a database of dysarthric speech. A comparison between architectures shows that, even with a small database, hybrid DNN-HMM mode...

متن کامل

An Automatic Dysarthric Speech Recognition Approach using Deep Neural Networks

Transcribing dysarthric speech into text is still a challenging problem for the state-of-the-art techniques or commercially available speech recognition systems. Improving the accuracy of dysarthric speech recognition, this paper adopts Deep Belief Neural Networks (DBNs) to model the distribution of dysarthric speech signal. A continuous dysarthric speech recognition system is produced, in whic...

متن کامل

Binary Deep Neural Networks for Speech Recognition

Deep neural networks (DNNs) are widely used in most current automatic speech recognition (ASR) systems. To guarantee good recognition performance, DNNs usually require significant computational resources, which limits their application to low-power devices. Thus, it is appealing to reduce the computational cost while keeping the accuracy. In this work, in light of the success in image recogniti...

متن کامل

Deep segmental neural networks for speech recognition

Hybrid systems which integrate the deep neural network (DNN) and hidden Markov model (HMM) have recently achieved remarkable performance in many large vocabulary speech recognition tasks. These systems, however, remain to rely on the HMM and assume the acoustic scores for the (windowed) frames are independent given the state, suffering from the same difficulty as in the previous GMM-HMM systems...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

ژورنال

عنوان ژورنال: Periodicals of Engineering and Natural Sciences (PEN)

سال: 2021

ISSN: ['2303-4521']

DOI: https://doi.org/10.21533/pen.v9i4.2450